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Abstract —In this paper, we consider communication over the 
bandwidth limited analog white Gaussian noise channel using 
non-orthogonal pulses. In particular, we consider non-orthogonal 
transmission by signaling samples at a rate higher than the 
Nyquist rate. Using the faster-than-Nyquist (FTN) framework, 
Mazo showed that one may transmit symbols carried by sine 
pulses at a higher rate than that dictated by Nyquist without 
loosing bit error rate. However, as we will show in this paper, 
such pulses are not necessarily well localized in time. In fact, 
assuming that signals in the FTN framework are well localized 
in time, one can construct a signaling scheme that violates the 
Shannon capacity bound. We also show directly that FTN signals 
are in general not well localized in time. Therefore, the results of 
Mazo do not imply that one can transmit more data per time unit 
without degrading performance in terms of error probability. 

We also consider FTN signaling in the case of pulses that 
are different from the sine pulses. We show that one can 
use a precoding scheme of low complexity to remove the inter¬ 
symbol interference. This leads to the possibility of increasing 
the number of transmitted samples per time unit and compensate 
for spectral inefficiency due to signaling at the Nyquist rate of 
the non sine pulses. We demonstrate the power of the precoding 
scheme by simulations. 

I. Introduction 
A. Background and previous work 

In 1949, Shannon 11| presented his famous result on the ca¬ 
pacity of the additive white Gaussian noise (AWGN) channel 
for band limited signals. The result is based on communication 
using orthogonal pulses (to avoid inter-symbol interference) 
which corresponds to transmission at the Nyquist sampling 
rate. 

In |2j, Mazo considered the uncoded transmission case 
for binary symbols, where the sample rate is faster than 
the one dictated by the Nyquist sampling theorem, so called 
faster-than-Nyquist (FTN) signaling. The minimum Euclidean 
distance between pairs of binary signals was used as a per¬ 
formance measure, and it was shown that one can transmit 
the signal pulses faster than the Nyquist frequency without 
decreasing the Euclidian distance between any two signals. 
The limit for such rate increase is known as the Mazo limit: 
p ~ 0.802, and was derived in 0 , 0 - Recently, a series of 
work has explored FTN signaling, see ||5| and the references 
therein. This work has also been extended to two dimensions, 
the second dimension being the frequency domain [6|. The 
signals are packed tighter also in the frequency domain, pos¬ 


sibly introducing interference between previously uncorrelated 
subcarriers. However, this inter-carrier interference does not 
affect the reliability of the signaling with a certain packing 
density, and with the use of an optimal detector the error 
rates would remain unchanged. Also, in |[7J it was shown that 
FTN can achieve the maximum capacity for a given pulse (as 
opposed to orthogonal transmission). This work also considers 
the use of FTN for achieving higher capacity for non-sinc 
pulses when the code alphabet is finite. 

The original derivation of Shannon’s capacity formula [1| 
is based on transmission of uncorrelated sine pulses. These 
signals have infinite support in the time domain and hence, as 
pointed out by Wyner in |8j, “the idea of transmission rate 
[using band limited pulses] has, at best, a limited meaning.” To 
treat this problem in a rigorous manner, Wyner proposed sev¬ 
eral physically consistent models with corresponding coding 
theorems, thereby justifying Shannon’s capacity formula |8|. 
Also, in the FTN framework, the concept of transmission rate 
is problematic and there is no guarantee that a signal consisting 
of a linear combination of pulses centered at a given time 
interval has its energy localized in the vicinity of that interval. 
In fact, as we will see in Section [III] one can easily construct 
examples where this is not the case. Another problem in the 
FTN framework is that the algorithms used for detection and 
estimation suffers from high complexity, rendering NP-hard 
problems in general (9J, [lOj. 

B. Contribution 

We consider communication over the bandwidth limited 
analog Gaussian white noise channel using non-orthogonal 
pulses. In particular, we consider non-orthogonal transmission 
by signaling samples at a rate higher than the Nyquist rate. Us¬ 
ing the faster-than-Nyquist (FTN) framework, Mazo showed 
that one may transmit symbols carried by sine pulses at a 
higher rate than that dictated by Nyquist without loosing bit 
error rate. However, we show in this paper that such pulses are 
not necessarily well localized in time. In fact, assuming that 
signals in the FTN framework are well localized in time, we 
show that one can construct a signaling scheme that violates 
the Shannon capacity bound. We also show directly that FTN 
signals are in general not well localized in time. Thus, it’s not 
physically correct to talk about bits per second, as the energy 
of non-orthogonal signals may not be well localised in time, as 


opposed to orthogonal sine pulses, for instance. Therefore, the 
results of Mazo |Zj do not imply that one can transmit more 
data per time unit without degrading performance in terms of 
error probability. 

We go on and consider FTN signaling in the case of pulses 
that are different from the sine pulses. We show that one 
may use a precoding scheme of low complexity in order 
to remove the inter-symbol interference. This leads to the 
possibility of increasing the number of transmitted samples 
per time unit (keeping average energy per time unit bounded 
by some constant) and compensating for spectral efficiency 
losses due to signaling at the Nyquist rate, and so, achieve 
the Shannon capacity when the same energy is spent for 
transmitting with the ideal sine pulses. We demonstrate the 
power of the precoding scheme by simulations. 

II. Preliminaries and Properties of 
Non-Orthogonal Pulses 

To start, we introduce some basic notation. Let N denote the 
set of natural numbers {0,1,2,...}, R the set of real numbers, 
and C the set of complex numbers. The nxn identity matrix 
is denoted by I n . X ~ Af(0, X) denotes that X is a Gaussian 
variable with E{X} = 0 and E{XX^} = X. We use X >- 
0 (X y 0) to denote that the matrix X is positive definite 
(semidefinite). X T is the transpose of a matrix and X* is 
the conjugate transpose. The empty set is denoted by 0 and 
int(fl) denotes the interior of the set Cl. The floor function 
|_xj denotes the greatest integer less than or equal to x. We 
define the normalized sine function as 
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Definition 1 (Signal Space). Let £ 2 (R) be the Hilbert space 
of functions / : R — > C that are square integrable, endowed 
with the scalar product 

/ OO 

-OO 

and squared norm 

/ OO 

\f(t)\ 2 dt < oo. 

-OO 

Definition 2 (Fourier Transform). The Fourier transform of a 
function / £ £ 2 (®0 is given by 

/ OO 

me-^dt. 

-OO 

Definition 3. £i([— 7r, 7t]) is the Banach space of functions 
/ : R — X C that are absolutely integrable, with the norm 

ll/ll = f \f(t)\dt < oo. 


Definition 4 (Gaussian White Noise Process |TT|). The 
stochastic process Z(t) is a white Gaussian noise process if 
Z(t) has a constant spectral density over all frequencies. 


Proposition 1. Let Z(t) be a zero mean white Gaussian noise 
process with spectral density 4^-. For g k ,ge £ £2(®0, define 


Z k = ( Z,gk ) 



Z{t)g k (t)dt 


and 


POO 

Zt = (Z,gt)= Z(t)ge{t)dt. 


J —OO 

Then, Zand Z\ are zero mean Gaussian variables with 
covariance 


E {Z k Z t } = ^ j g k (t)ge(t)dt. 

Proof: See ED- ■ 


Proposition 2 (Shannon Capacity of the Discrete Memory-less 
Gaussian Channel). Let {Z k }^ =1 be a set of independent 
Gaussian variables with Z k ~ 7V"(0, N f ) and consider the 
Gaussian channel with input sequence of samples {X k }^ = i 
and output 


Y k = X k + Z k , for k = 1,..., n. 


Suppose that 


£>|A ' fc | 2 <nE s . 


fc= i 


Then, the capacity of the channel is given by 

2 E„ 


C = log 2 1 + 


N n 


bits/sample 


and is achieved for X k ~ Af(0,E s ). 

Proof: See ED- ■ 

Definition 5 (Toeplitz matrix). A matrix T n is called Toeplitz 
if it has the following form 


T = 

-T r). - 
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Definition 6 (Associate function). Let T n be a Toeplitz matrix 
according to Definition [5] A function / £ C \ ([—tt, 7t] ) is 
called an associate function to T n if f(z ) is associated with 
the corresponding Fourier series of the matrix elements i.e., 
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where 


Ck ~ 2tt 


k ——oo 


— ikz 


f{z)dz. 


The matrix is sometimes denoted T n (f), and in the infinite 
case we write T{f). 








The notation is motivated by the fact that the associated 
function determines the distribution of the eigenvalues of the 
matrix. To state this, we need the following definition. 

Definition 7 (Equal distribution). Let M be a constant in R 
and 0 < n £ N. Also, let {c4”^}”=i and { l)'/' 1 }™ =1 be sets in 
R such that 


Proposition 5 (Diagonalizing a Circulant matrix). Let C n be 
a Circulant matrix. Then, 

c n = UA n U*, 

where A n = diag(Ai, A 2 , ..., X n ) and U is the Fourier 
matrix: \U] m ,t = e~ 2mrne ^ n for m, i = 0 , 1 , ..., n — 1 . 


|a^| < M, \b^\ < M, for all 1 < £ < n 

We say that {a ^"^}” =1 and 1 are equally distributed 

on [— M, M] if for any continuous function 

F : [-M, M] ->• R, 


we have that 


lim Eti F{a™)-F{b™) 


n—> 00 Tl 

Proposition 3 (Spectrum of a Toeplitz matrix). Let T n be 
a Toeplitz matrix with associated function f. Also let AjE, 
£=1,2 be the eigenvalues of T n . Then, 


inf f(z) < X ( e n) < sup f(z). 

2 2 


Proof: See © and ©• ■ 

To be concise and to avoid ambiguities regarding the 
simulations we also define the following words: 

Definition 9 (Payload bits). A payload bit is a bit realized 
from some input distribution. This bit represents pure data 
that we want to communicate and it is on sets of these that we 
compute block-error rates (BLER) and bit-error rates (BER); 
these are also used in the measure of throughput. 

Definition 10 (Physical bits). A physical bit is the bit phys¬ 
ically transfered over the channel. A physical bit represents 
some type of coded realization of a payload bit. The number 
of physical bits, denoted ^physical bits, is 

#payload bits = ^physical bits • code rate. 


Moreover, the sets 



are equally distributed. 

Proof: See © or ©■ 


When measuring signal-to-noise ratio (SNR), it will be done 
with respect to physical bits. 

Now introduce the scalar products 


Hkt — 


(hk,h e ) 



h k {t)hi(t)dt 


( 1 ) 


Proposition 4 (Associate function of the square root inverse). 
Let T n be a Toeplitz matrix with associate function f and let 
f(z) > 0 for all — 7 T < z < w. Moreover, let \Jf be such that 
its Fourier series has absolutely summable coefficients (that is; 
y/f is in the Wiener class 114]). Now let K n be the inverse 
of the hermitian square root of T n , 


and define the matrix H whose elements [H\m = H)~e for 
k, £ = 1,..., n. The matrix H is the Gramian of the functions 
h \,..., h n . 

Proposition 6 (Positive Definiteness of the Gramian Matrix). 
Let hi ,..., h n £ £2 (R) and consider the Gram matrix 


K n = ((T„)5) _1 . 

Then, K n is asymptotically Toeplitz in the sense that there 
exists a Toeplitz matrix r„ such that for D n = K n — r„ 

lim - Tr (D* n D n ) = 0. 

n—f 00 Tl 

Moreover, the associated function to K n is given by 1 / y/ f(z) 

Proof: If y/f is in the Wiener class, then so is /. The 
proposition now follows from application of Theorem 5.3 (c) 
and Theorem 5.2 (c) in ©■ ■ 
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where Hki = ( hk,h(). Then, H A 0. Furthermore, H is 
invertible if and only if hi, ...,h n are linearly independent. 

Proof: See |l 6 j pp. 12-13]. ■ 

We will consider Mazo’s model |2j consisting of a finite 
number of modulated sine pulses transmitted at a rate that is 
higher than the Nyquist rate. Let T = /y and 


Definition 8 (Circulant matrix). A matrix C n is called Circu¬ 
lant if it has the following form 
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g(t) = V2W ■ sine ( 2 Wt) = Vf 
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7 rt 


Introduce 


hk{t) = g(t- p 


k- 1 
2 W 


for p £ (0,1]. For any real number r, we have that 


It is thus a special case of a Toeplitz matrix. 


g(t - t) A G[u))e~ iuT 















Figure 1. Signal of the form j2j where p = 0.81, n = 20, and T = 1 with Figure 2. Top: sine pulse centered at f = 6. Middle: Basis functions for 
more than 50% of the energy is outside the interval Q = [—15,34], p = 1/4, n = 4, and T = 1. Bottom: Optimal £ 2 (K) approximations for 

pe {1/2,1/3,1/4}. 


and so the spectrum of g(t — r) is invariant under (time) 
shifting since 

|G( W )e—| = |G( W )|. 

Now, let X(t) be a signal consisting of m = \n/p\ samples 
transmitted with a rate that is faster than Nyquist (that is p < 
1), given by 


m 

X(t) = J2 A khk(t) ( 2 ) 

k=i 

where { A /,.} are real valued random variables. It’s faster than 
Nyquist in the sense that the pulses are spaced in-between with 
a time distance pT instead of T. Thus, the sample density is 
~pT = ~[T samples/second. 

Lemma 1 . The functions 

k = 1, are linearly independent. 

Proof: See the appendix. ■ 


III. Time Localization and FTN 


A signal is well localized to a time interval fl C K. if the 
signal energy outside the interval is small compared to the 
total signal energy, i.e., if 


x D (x,n) 


fn x ( t ) 2dt > , _ |; 
ITOII 2 - M 


(3) 


where p is close to zero. 

An implicit assumption when considering signals X(t) of 
the form ([2} is that the signal is well localized in time to 


the interval f2 = [0, (n — 1)T] (that is, approximately time- 
limited). However, in the faster than Nyquist framework, this 
assumption is violated and as we will see one can create 
signals with a considerable proportion of its energy content 
outside this region. This implies that signals of the form 
0 with p < 1 may not satisfy even for moderate size 
p and intervals Q that contains [0, (?z — 1)T], for example 
f l = [—M, (n — 1)T + M], for some M > 0 (e.g., as depicted 
in Figure [T] with M = 15). 

To see another example of this, consider pulses consisting 
of the (non-orthogonal) basis functions g(t — p(k — 1)) for 
k = 0,... ,m = [4/pJ for a given p < 1 and where W = 
1/2. Figure [ 2 ] shows the best quadratic approximations of 
the sine pulse g(t — 6) for p € {1/2,1/3,1/4}. By Lemma 
|T] such approximation is not exact, but the approximation is 
very close for p = 1/4. This example shows that using the 
faster than Nyqvist framework with small p, one can create 
signals with support outside the designated “time slot” of the 
signal, for example approximating a sine pulse centred outside 
the interval [0, (n — 1/7']. In fact, it is possible for each fixed 
n > 0 to approximate any band limited signal as a faster than 
Nyqvist pulse if the rate p is sufficiently small. This is the 
content of the following proposition. 

Proposition 7. Let n > 0 be fixed. Any £2 (R) function whose 
Fourier Transform is restricted to the frequency band [-W, W] 
can be approximated arbitrary close (in £ 2 ( 11 )) by a function 
X(t) of the form §2^ for some positive number p < 1. 

Proof: See the Appendix. ■ 

It is possible to construct examples where the energy is not 
localized to the interval [0, n — 1] also for p larger than the 














































FTN signal, Sine with rho= 0.9 




Figure 3. Maximal energy outside = [— m, m + n — 1] as a function 
of m, of a signal {2} where p E {0.7, 0.81, 0.9,1} and where n = 20. 


Mazo limit. Consider the problem of determining the maximal 
energy outside the interval fl m := [—m, m+n— 1 ] for a signal 
X(t) of the form Q, i.e., 

max / X(t) 2 dt subject to ||X(f )|| 2 < 1 

x ( t ) J R\n m 

m 

subject to X (t) = E A k h k (t) 
k= 1 

Note that this optimization problem may be written as an 
eigenvalue problem 

max A J (H — H(Ct m ))A subject to A 1 HA < 1, 

where 

H{Q m )k,e= [ h k (t)he(t)dt, 

J n m 

and the maximum corresponds to the largest eigenvalue of the 
matrix I — H~ 1 H(Cl m ). Figure ([3) shows how the maximal 
energy outside fl m depends on m where p £ {0.7,0.81, 0.9,1} 
for the case n = 20. Figure 0 shows an example where 
p = 0.81 and more than 50% of the energy is outside the 
interval U 15 = [—15,34]. 

This effect may also be seen when the alphabet is con¬ 
strained to a discrete set of points, e.g., for binary symbols. 
To illustrate this, let the signal be of the form 0 where 
Ak £ {—1,+1} with T = 1, p = 0.9, and rn = 400. The 
signal corresponding to Ak = (—l) fc is depicted in Figure [ 4 ] 
The darker region of the signal, between the vertical bars, is 
the time \—p, pin] where the signal energy is supposed to be 
localized. However, as can be seen a considerable proportion 
of the energy falls outside this region. In fact numerical 
integration shows that about 75% of the energy is confined in 


Figure 4. This figure shows the resulting signal when a Sine-pulse is 
transmitted with a completely alternating sequence of A & = (—l) fc for 
k = 1,..., 400. The darker part of the signal, surrounded by vertical bars, 
is the part t = —p and t = 400 p respectively; that is one symbol-time 
before/after the first/last of the main peaks. 


the interval t £ [—p,pm], and only about 27% to the interval 

t £ [0, p(m — 1)]. 


IV. Time Localization and capacity of FTN 


Next, we examine the FTN model from a viewpoint of chan¬ 
nel capacity. Assuming that the FTN model (|2| only contains 
signals that are well localized to the interval [0, (n — 1)T], we 
will see that the Shannon capacity can be violated. Again this 
illustrates that the FTN model contains signals that are not 
well localized in time. 

The energy of a signal given by 0 is 


E|| A'|| 2 = E 


^ Akhk 

k =1 


= E / A k hk, yy Ache 


\ k =i 


e=i 


= eEE HkcA k Ac 

fc =i 


= E A T HA. 


Let E s be the energy per sample in the case of p = 1. We will 
restrict the energy to not exceed the energy for the Nyquist 
signaling case, that is, the same as the energy for signaling n 
samples. For p = n/m, we have 

Ti 

E A 1 HA = mpE s = —pE s = nE s 
P 

which gives an average energy per sample to be 

— E A 1 HA = pE s . 
m 



















Since the time duration between consecutive samples is pT, 
the time spanned of m samples is mpT. Now if the non- 
orthogonal pulses were well localized in time, we would get 
over an infinite time horizon the average power 


lim -— 

m-¥oo mpT 


E A 1 HA = — = P. 

pT 


(4) 


Thus, the average power does not exceed that of the Nyquist 
signaling case. The received signal over the AWGN channel 
is thus given by 


Y(t) = X(t) + Z(t) 


where Z(t) is a zero mean white noise Gaussian process. We 
define the measurements 


Y k = (h k ,Y) 

= ( h k ,X + Z) 

m 

= (^^5 'y ^ A-ihi) 4 - Zk 

1=1 

m 

= hi) At + Z k 

t =l 
m 

= H k (A( + Z k . 

t -1 

We may write 0 in the more compact form 


(5) 


Y = HA + Z. 


(6) 


It’s not hard to check that { Y k } is not a sequence of indepen¬ 
dent variables since {h k } is not an orthogonal set of functions. 
The covariance of Z k and Zf is given by 

E{Z k Z e } = ^(h k , h e ) = ^H m . (7) 

The functions {h k } are linearly independent according to 
Lemma [l] and Proposition [h] implies that H >- 0. Let H 5 y 0 
be the unique positive definite matrix such that H? H? = H. 
Then, we may write 

Z = H*V, V~Nf(0,^I m ) 

and ([6]) is equivalent to 

Y = HA + H^V. (8) 

Since H is positive definite, it’s invertible, and so is H i. 
Multiplying the left and right hand side of Equation ([8} by 
H~i gives 

S = H~^Y = H^A + V. (9) 

Now let X = fTs A, using the precoding A = H~^X. This is 
always possible since H is nonsingular. The energy constraint 
on X is then 


^E|X fc | 2 = E|A'| 2 = EA J HA = pmE s . 

k -1 


The channel capacity over the discrete time channel 

N n 

S k = X k + V k , V k ~AT(p,-£-), k = 1, ..., m, (10) 
with a sequence { X k } of total energy 

m 

^E|X fc | 2 = pmE s 
k =1 


is maximized for a sequence {X k } of independent identically 
distributed Gaussian variables with X k ~ J\f(0,pE s ). Thus, 
the capacity of m samples is 


C - = m \ log, (l + ^L) 

= ”' 5‘ 0fo ( 1 + j^) “ K - 

Recall that the sample density is samples/second. Then, 
the capacity per sample is C m /m and the average capacity in 
bits per second as time goes to infinity is 


C(p) 


lim 

m—> oo 


2W _ Crn 
p m 




bits/second. 


From the derivations above, we see that Mazo’s model 
implies that the channel capacity in fact increases as we pack 
signals tighter in time, that is when p decreases(note that 
p = 1 renders Shannon’s result for orthogonal pulses). This 
rather unexpected (and false) result may be explained away 
by considering the energy localization of signals in time. The 
signals are not well localized in time and the limit in the 
expression of the average power in ([4]) is not equal to P but is 
equal to a smaller number since the energy could be arbitrarily 
small over a time period mpT. 


V. FTN for Non Ideal Pulses 
A. FTN Gramian and spectrum 

We will show that FTN can be applied to make good use of 
spectrum leakage, when pulses with discontinuous spectrum 
are difficult to use. The following result is similar to results 
that can be found in (7), but provides additional insight since 
the associated function is explicitly presented, something we 
will make use of later. 


Proposition 8 (Toeplitz- and pulse-spectrum). Let hit) € 
£2(R)> and 

h k (t ) = h{t - r k ) A fT(w)e _l “ Tfc k = 1,..., n, 

with the restriction that r k = r • (k — 1) for some constant r € 
R + \{0}. Furthermore, let H n be such that [H r ,\ k j = ( h k , hf) 
Then, 

i) H n is a Toeplitz matrix T n {f) . 

ii) The associated function f £ £i([— 7r, 7r]) is given by 


m 



H 


(^) 


2 









where H(oj) is the Fourier transform of h. This holds for 
all z £ [—7r, 7r] except possibly at a set of measure zero. 

Proof: See the Appendix. ■ 

What this proposition says is that the Gram matrix generated 
by a set of systematically shifted pulses is in fact Toeplitz, 
and the associated function / is the folded spectrum of 
the pulses. Given a closed form of the Fourier coefficients 
Ck-\ = [H n \k i we can always try to calculate / directly. 
However, Proposition [8] gives a more systematic and general 
framework to it. From this result we can for example see 
that a transmission that is free from inter-symbol interference 
needs to have a folded spectrum that is constant, something 
that is well known in the literature gg, (F7), (18). 

B. Precoding and capacity of the precoded signal 

Consider a pulse shape g{t) £ £ 2 ( 11 ) with Fourier transform 
G(w) where G(w) 0 for |w| < 2-kW' and G(w) = 0 for 
|w| > 2-kW' . If the transmitted time-shifted pulses g(t — kT ) 
are to be orthogonal, the time shift T must be larger than 
T' = 2W7, unless g(t) is a sine pulse [18]. However, by 
using the precoding described in Section [fV| we can transmit 
the non-orthogonal pulses h k (t ) = g(t — kT') and get an 
easy detection algorithm (see Section |IV| ). Then, the channel 
capacity becomes 

C = W' log 2 (1 + ——- ) bits/second, 

2 V N 0 W'J 

which is exactly the capacity for sine pulses with average 
transmit power P and bandwidth W'. To use this result we 
will have to show that the signal X(t) = A k hk(t) is well 
localized in time when n is large enough. 

The time localization of (a large number of) sine pulses 
transmitted at the Nyquist rate is well established (as defined 
and shown in |8]), and thus, we will show that as the number 
of pulses goes to infinity, the precoded signal approaches a 
train of sine pulses transmitted at the Nyquist rate. 

Theorem 1 (Time localization of precoded signals). Let g(t) £ 
be a given pulse, and let its Fourier transform G(u>) 
be such that |G(w)| is piecewise continuously differentiable, 
G{<jj) 0 for |u;| < 2ttW', and G(oj) = 0 for |w| > 2'kW'. 
Introduce 


and 

hk(t) = g(t - kT 1 ) . 

Let H 2 n +i be a (2n+l) x (2n+l) matrix where [H 2 n+i]k,i = 
(hk, hf), k,l = —n ,..., n, X £ K 2ra+1 , A = H 2r f +1 X, and 
X{t) be given by 

n 

X(t) = J2 A k h k(t) . 

k——n 

Then, X{t) is well localized in time. 

Proof: See the Appendix. ■ 


White Gaussian noise 


Z(t) 



Input data 
X 


Decoded samples 
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Figure 5. Block diagram of the precoding and decoding; with transmis¬ 
sion over an AWGN-channel. From independent discrete input data X to 
independent discrete samples S 


C. Complexity and implementation 


In general the problem of estimating the transmitted se¬ 
quence is NP-hard (9j in the case of inter-symbol interference 
and usually, one has to rely on the Viterbi algorithm which 
has exponential complexity m- 

However, the precoding scheme that has been suggested in 
Section IV reduces the problem to an independent estimation 
for each sample, which is a much easier problem. A block 
diagram of the process can be found in Figure [5] Starting 
from the transmitter side we take the discrete data X as input. 
This is precoded using A = H~ 1 / 2 X and sent through a pulse 
shaping filter 0 over the AWGN-channel. On the receiver 
side this is sampled using a matched filter, these samples 
are decoded to give the discrete output samples S, which 
according to ([TO]) are decoupled such that ,S), : only depends 


on X k . 

For many cases the algorithm can be implemented effi¬ 
ciently for large n utilizing the fact that the Toeplitz matrix 
H is asymptotically circulant For such cases, H can 

be approximated by a circulant matrix and the algorithm can 
be well approximated by an implementation using FFT with 
complexity O (nlog(n)). Note that if p is selected to be too 
small, then the eigenvalues of the Toeplitz matrix H will also 
be small and this approximate approach will break down. The 
implementation used in following subsection is done using 
both this method and an implementation based on full size 
matrices, and the results are identical for all the simulations. 


D. Example - Root-Raised-Cosine 

We exemplify the theory by applying it to the root-raised- 
cosine pulse, from this it is also easy to see how it works for 
other pulses. 

As before, let T = ^ 7 , and introduce 


9p(t) 


4/1 cos ((1 + /3)tr)p) + Sm((1 4 4 Vr) 
ks/T 1 - ( 4/?£) 2 


HD 


















defined for p £ [0, 1]. Let the Fourier transform of gp(t) be 


Vt 

Gp(u) = O 


0 


0<M<(1-^)- 

< M < (1+^)^ (12) 
M>(l + /3)^ 


with 



(13) 


We can see that the pulse is a roll-off pulse that is defined 
with a spectrum leakage p, the transform has support in linear 
frequency on an interval of size W' = W (1 + 0). It is also 
a Nyquist pulse, since with T as chosen above the pulses are 
orthogonal. When applying FTN to the root-raised-cosine we 
arrive at the following result. 


Lemma 2 (Associated function to the Root-raised cosine 
Gramian). Let h k {t) = yfp ■ gp with g p given 

by (FT). k = 1,2,... ,n, and p < 1. Moreover let //,, be the 
Gramian of these pulses as given by Proposition [6] Then the 
associated function / to H n is given by 


with 


/(*) 


{B) If (1 + p)p < 1 
CD if (i + P)p > i, 


(14) 


0 


1 ( 

1 +“»('»)) 

1 


i( 

1 *>(W)) 

0 



2 e [—7T, -(1 +/I)p7r] 

2 e [-(1 + P)pn, -(1 - P)pn\ 
z £ [-(1 - P)pit, (1 - P)pw] 
z e [(1 - P)pn, (1 +/3)H 
2 e [(1 + P)ptt, tt] 

(15) 


1 - sin (^w 1 ) cos (w) 

z e [-7T, -(2 - (1+ P)p)tt] 

z e [-(2 - (1 + /3 )p)tt, -(1 - P)ptt] 

z £ [-(1 - P)pn, (1 - P)pn] (16) 

z £[( 1 - P)pn, (2 - (1 + P)p)tt) 

cos (w) 

z £ [(2-(1 + /3 )p)tt, tt]. 


Proof: See the Appendix. ■ 

By inspecting Lemma [2] in the context of Proposition [3] we 
can see that we only have a numerically well behaved Gramian 
matrix and numerical stability if (1 + ft)p > 1. This is indeed 
also true for all roll-off pulses that utilize extra bandwidth P, 
which can be concluded from Proposition [8] 

We can see that with p = and j 1 ' = pj 1 , the root- 
raised-cosine fulfills all prerequisites in Theorem [I] Hence, 


for a fixed average power P the capacity for precoded FTN 
with root-raised-cosine becomes 

C = (1 + 0)W log 2 (l + NoW ( 1+f}) ) bits/second. 

E. Simulation results 

As a proof of concept, we have conducted simulations where 
the suggested precoding is applied and implemented, using 
FFT approximation as described above. These simulations are 
based on the model presented in (|8j. 

We apply the root-raised-cosine function as a pulse shape 
and use a roll-off factor p = 0.22 to mimic existing standards 
as in (DD- We are only looking at binary input-output and 
don’t use any higher order modulation. 

These simulations are such that we keep the time for a 
block constant, with the reference that in the Nyquist case 
(p = 1) one block should be 4000 physical bits. The Nyquist 
transmission also serves as a reference in the sense that these 
pulses are of unit energy. We are keeping the power constant 
for all the schemes, meaning that FTN must use a lower energy 
per physical bit. This in turn relates to the SNR given in the 
plots, as this is a Nyquist reference type of SNR in order to be 
able to make a fair comparison between the different schemes. 
The SNR is given in dB and relates to the standard deviation 
of the sampled noise as 

G = 10 ~ SN W2° ; 

since Nyquist transmissions applied unit energy per physical 
bit. This standard deviation is also used for FTN transmissions 
regardless of the fact that FTN is using lower energy per 
physical bit, so the experienced SNR per physical bit will 
be worse than what is actually written on the axes in the FTN 
case. This comparison is motivated since it allows us to judge, 
for a given channel state, if it is worth changing a Nyquist 
scheme for FTN. 

In addition, this simulation also applies (WCDMA) turbo 
codes according to |j20jj. The coding is applied on the payload 
bits to create the physical bits, X, on which we then apply the 
precoding TT -1 / 2 . Similarly at the receiver we fist apply FTN 
decoding, to produce S, before using turbo decoding which 
gives the estimated payload bits. For every value of p and 
SNR presented, we have looked at code rates from 1/3 up to 
« 0.96 in a grid containing 18 points (exact code rates are 
rounded due to finite block length). From this, we have then 
computed the throughput as: 

throughput = (1 - block-error rate) • ^payload bits, 

and selecting the code rate giving highest throughput. The 
throughput is given as bits per equivalent time unit and the 
reference is, as mentioned, the Nyquist scheme transmits 4000 
physical bits, the exact bandwidth is then just a scaling from 
this. 

Another way of calculating the throughput, although less 
attractive for practical purposes, is to rely on the bit-error rate 
instead of the block-error rate. The resulting throughput can 
be found in Figure [7] It should however be noted that this 

















VI. Conclusions 


Throughput based on BLER ploted versus SNR. 
The equivalent Nyquist block consists of 4000 bits 



Figure 6. Throughput as given by throughput = (1 — block error rate) • 
#payload bits. One can observe that the FTN scheme seems to need a higher 
SNR in order to reach better throughput, and the lower the p, the higher the 
SNR. However, once this is reached it starts to outperform the Nyquist case. 
The stagnation in throughput at high SNR is due to the code rate not going 
higher than « 0.96. 


Throughput based on BER ploted versus SNR. 
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Figure 7. Throughput as given by throughput = (1 — bit-error rate) • 
#payload bits. This plot shows throughput gains using FTN even at low 
SNR, although this is less informative from a system perspective. The same 
stagnation in throughput at high SNR as in Figure [6] can also be seen here, 
we have reached BER = 0. 


optimization resulted in all schemes using the highest possible 
code rate for all SNR. 


We considered communication over the analog white Gaus¬ 
sian noise channel using a finite bandwidth [-W, W] and lion- 
orthogonal pulses by signaling at a rate that is higher than the 
Nyquist rate. We showed that the conclusions in j2j, that one 
may transmit symbols carried by sine pulses at a higher rate 
than that dictated by Nyquist without loosing in bit error rate 
don’t imply that the bit error rate per time unit decreases. This 
was demonstrated by showing that if the model in 0 is valid 
to consider bit error rates per time unit, then it means that 
non-orthongal signals may achieve a capacity for the AWGN 
channel that is higher than the Shannon capacity. We explain 
this phenomenon by means of an example where we show 
that non-orthogonal signals do not give rise to well localized 
energy in time. Thus, it’s not physically correct to talk about 
bits per second, as the energy of non-orthogonal signals may 
be more spread over time. Therefore, the results of Mazo 0 
do not imply that one can transmit more data per time unit 
without degrading performance in terms of error probability. 

We also considered FTN signaling in the case of pulses that 
are different from the sine pulses. We showed that one may 
use a precoding scheme of low complexity, in order to remove 
the inter-symbol interference. This leads to the possibility of 
increasing the number of transmitted samples per time unit 
and compensate for spectral efficiency losses due to signaling 
at the Nyquist rate of the non sine pulses. Thus we can 
achieve the Shannon capacity when the same energy is spent 
on transmitting with the ideal sine pulses. 
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for all oo £ int(fl). Since e aiU1 ,..., e ak ~ lU are linearly 
independent by the induction hypothesis, we must have C\ = 


= Ck-i = 0. But this contradicts (17 1 since Ck 7 ^ 0. We 


conclude that our assumption that e° 


are linearly 


dependent was false, and linear independence for n = k — 1 
implies that for n = k. Thus, e aikJ , ..., e anU1 are lineary 
independent for any positive integer n, and we are done. ■ 
Next, we will prove Lemma [I] by contradiction. Suppose 
that hi,...,h n are linearly dependent. Then, there exists 0 f 
(ci, c n ) £ C n such that 

c\hi(t) + ■ ■ ■ c n h n (t) =0, Vt £ R. 

Taking the Fourier transform of the equation above, we get 
ci H{uj)e- iwTl + ■ ■ ■ + c n H (uo)e~ iuiTn =0, Vw £ R. 

In particular, we have that 

ci ff(w)e"“ Tl + • ■ ■ + c n H{oo)e~ iuT - = 0, 


Appendix 

A. Proof of Lemma [7] 

To prove the statement, we need the following result. 

Proposition 9. Let LI C R with int(f2) 7 ^ 0 and ai, ...,a n £ 
C with ai 7^ aj for i 7^ j. Then, e“ iaj ,..., e a,l “ are linearly 
independent for 00 £ LI. 

Proof: We will prove the theorem by induction over n. 
The case n = 1 is trivial. Start with n = 2. If e aikJ and e° 2UJ 
are linearly dependent, then there exist Ci, C 2 £ C such that 

cie“ lW + c 2 e a2 “ = 0 Vw £ Cl 

which implies that 

Cie ( ai -a 2 ) u = _ C2 Vw e n 

In order for the equality above to hold for more than one value 
of oo , and since a \ 7^ we must have ci = c 2 = 0. Now 
let n = k — 1 and suppose that e aitJ , ...,e ak ~ lUJ are linearly 
independent but e“ iaj ,..., e akLJ are linearly dependent. Then, 
there exist ci, ...,Cfc £ C with Ck 7 ^ 0 such that 

c ie aiU1 + ■ ■ ■ + c k e akU =0 Vw£U. 

Multiplying the right and left hand sides of the equality above 
with e~ akU1 gives 

Cie (a!-a fe V + . . , +Cfc _ ie (a fc _ 1 -a*)«*' + c fc =0 Vw £ Cl (17) 

Since int(fl) / 0, we may take the derivative of the left hand 
side of ( | 1 1\ for oj £ int(fl). Taking the derivative of the 
equality ( |17| > gives 

(cti - a k )c ie (ai_afe) “ q-h 

{a k -1 - afe)c fe _ie (afc - 1_afc) “ = 0, Vw £ int(fl). 

Now multiply the right and left hand sides of the equality 
above with e“ fe “: 


for all oo £ (— 2-kW, 2ttW), which implies that 
Cl e~ iulTl + ■ ■ ■ + c n e~ iuTn =0, Vw £ 2ttW). 


Now letting a k = —ir k , for k = 1 and using 

Proposition [ 9 ] with Cl = [—27TB 7 , 2nW], we see that e~ lulTk 
are linearly independent so we must have (ci,..., c n ) = 0 , a 
contradiction. Therefore h\,...,h n must be linearly indepen¬ 
dent, which concludes the proof. □ 


B. Proof of Proposition [7] 

First, note that we may take T = 1 without loss of 
generality. Next, introduce the following two linear projection 
operators. Let D : £ 2 (R) —► £ 2 (R) be the orthogonal 
projection onto the set of time limited function 


<“><*) = {„ Y(f) ‘" [M 
I 0 otherwise, 

and let B : £ 2 (R) —> £ 2 (R) be the orthogonal projection 
onto the set of band limited functions with frequency support 

[-W, IV] 

/ oo 

X(t) sinc(2IV(f — r))dr. 

-OO 

We now have the following lemma. 


Lemma 3. The closure of the range of BD is equal to the 
range of B, i.e., Range(BD) = Range(B). 

Proof of Lemma [i]' Note that Range(BD) C Range(B) 
since Range(B) is closed. To show that Range(BD) D 
Range(B), assume that X{t) = Range(B) n Range(BD) ± . 
By construction, we have that 


(X, BDY) = 0 for all Y £ £ 2 (R), 


and hence 


(«i - a k )cie aiU] H-h (a fc _i - a k )c k -ie ak - lU1 = 0 


(BX,DY) = (X, DY) = 0 for all Y £ £ 2 (R), (18) 







since B is an orthogonal projection and is thus self-adjoint. 
Now ( [T8| holds only if X(t) = 0 on t £ [0, n], which is 
only possible if X = 0, since X (t) is band limited. Therefore 
Range(BD) DRange(B), and the lemma is complete. ■ 
By Lemma |3j we may pick Y £ £'i(R) such that X = BDY 
is arbitrary close to the desired band limited function. Without 
loss of generality such Y may be chosen with support in [0, n]. 
Next, let 

r P k/(2W) 

Ak= Y(t)dt, for k = 1,..., m = [n/p\. 

Jp(k-X)/{2W) 

With this constmction E^Li Aj.6(t — p t f^) — > Y weakly as 
p —> 0. Since the total variation norm is uniformly bounded 
ELi \ A k\ < mUi we have that X{t) = EfcLi A kh k {t) -)• 
X as p —t 0, and the proof is complete. □ 


Proof of Proposition [S] 

The first statement is obvious by using 77 . = r ■ (k — 1) and 
direct calculation of 4D- 

To prove the second statement, we consider 

[H n ] k t = c k —i = ^f f(z)e~ i ^ z dz, 

but we also know that 

/ OO 

hk(t)hi(t)dt 

-OO 

1 /*°° _ 

= — / JT(w)e- i ( fc_1 ^"JT(w)da; 

2n J -OO 

(^)| 2 e~ i(k ~ 1)z dz. 


I 


H l 


The integral is absolutely convergent since h £ £ 2 (R), which 
allows us to do some further manipulations of that expression. 

poo -1 o 

1 -i(fc- l ) z dz 




= E 

£=— 00 ' 
00 

= E 

£=—o 


e 

,ir(2«+l) 1 
tt(2£— 1) T 

1 H 


H 


(^l 2 e~ i{k ~ 1)z dz 


z + 


e —i(k—l)z—i2ivke ( ^ z 


0 -i(k-l)z 


CO 


H 


z + 27rf 


dz. 


This shows that c k ~ 1 is the (k l):th Fourier series coefficient 
of the function 1 Efc -00 \H ( z+ ^^ ) |", and the result now 
follows from the uniqueness of Fourier series. 

Finally to show that / £ £i([—7r, 7r]), we use Proposition [ 3 ] 
and Proposition [ 6 ] to get that f(z) > 0. Hence, |/(z)| = f(z ) 
and 


\f(z)\ dz = c 0 



dz < 00 , 


since h is in □ 

Remark. It should be noted that the associated function 
/ is in £i([—7t, 7r]), and hence the formulation f(z) = 


t Etl -00 \dd ( z+ ^ ( ) f is strictly speaking only valid for 
z £ [—7r,7r]. However, the sum is such that it provides a 
periodic extension of / from [— 7r, 7r] to the whole of R. 

Proof of Theorem [7] 

For ease of notation, we will without loss of generality 
consider a symmetric transmission, that is 

n 

X(t) = Y, A kd(t - kT') , 

k=—n 

_ 1 

where A is the vector of precoded symbols, A = H 2r f +1 X. 

First, we know from Proposition [8] that 11 2 , 1 +] is Toeplitz 
with associated function 

1 OO 

/(*) = E 

£= — 00 

For our particular choice of T' we have that G = 0 

for £ fz 0 and z £ [—7r, 7r]. Thus, for z £ [— 7r, 7r], we have 

m = fG(f)k 

It’s easy to see that /(z) is 27r-periodic and it’s enough to 
consider the interval z £ [—7r, 7 t]. Now we will show that \Jf 
indeed has an absolutely summable Fourier series, which will 
allow us to use Proposition [d] in the analysis of H 2r f +1 . We 
have that 



and since |G(w)| is piecewise continuously differentiable, so 
is y/f. Thus, we know that \JJ has a Fourier series whose 
coefficients converge absolutely [|2T Theorem 7.21] and hence 
belongs to the Wiener class. 

Propositions [3] [4] and [ 6 ] together with the definitions of 
G(uj) and T' assures us of the existence of H 2r f +1 . Now let 



( Ci,o 

Cl ,n 

Cl,2n \ 

1 

H 2 — 

ii 2n+l — 

Cn+1,— n • • • 

Cn+1,0 

Cn+l,n 


\ C2n+l,-2n 

C2n+1, —n 

C2n+1,0 / 


Although the notation might indicate otherwise, the matrix is 
actually symmetric and the reason for numbering its transpose 
will be apparent later. By Proposition [4] the matrix is asymp¬ 
totically Toeplitz, and tends towards 
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CO 

Cl 
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C-2 

C-l 

CO 

Cl 
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C-3 
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CO 

Cl 
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with associate function /, as n —► oo. We have already 
established that yff fulfils the conditions of Proposition [8] and 
thus / = 1 /yff. The signal X(t) can now be written as 


m = xt (. hJ +1 ) j 


( g(t + nT') \ 

g(t + (n-l)T') 

9(f) 

g(t- (n — 1)T') 

V g(t - nT’) ) 


(19) 


By inspecting ( p~9| >, we can see that each uncoded symbol 
will be carried by an effective pulse ,e^ 2 " +1 ) (t) such that 

n 

x(t)= x,if n+ 1 \t). 

£=—n 

The pulse ^ 2n+1 \t) is given by 

n 

tf n+1 \t)= ]T c i+n+1 , fc _, g(t-kT'), 

k=—n 

or equivalently 

n—l 

Cp" +1) (£) = °i+n+i,k g(t -(k- l)T'). 

k=—n—l 

Taking the Fourier transform of £ ( 2 ” +1 ) (t) gives 

n—l 

r-,(2n+l)/ \ _ \ itojT' \ ' —ikujT' 

'~‘l x^) — (jryidje J ^ Cn+l,/c & 


k=—n—l 


Taking the limit n —► oo, we get 




where 


=/(w) = G(co)e ieuT ' ^2 c k e 


—ikujT' 


k ——oo 


= G(w)e‘ &T 7K) 


G(u 


n iluT' 




)(<*>) = 


Proof of Lemma [2] 

The lemma is proved by direct calculations. The assumption 
p < 1 is a technical one which ensures (1 —/3)p > 2 —(1 +/3)p, 
and with (3 £ [0, 1] it also implies (1 + /3)p < 2. 

We have that the Gramian for the root-raised-cosine pulses 
is simply given by a raised cosine with roll-off /3 and that is 
sampled at integer instances and with T = 1/p; namely 

r tt i -ii ow cos (Tr/3p{m - £)) 

[i?nW = psmc (p(m - £)) x _ 4 /3 2 p 2 ( m _ £)2 ■ 

This can in turn be rewritten as 

\Hn\m£ = 

(1 + / 8 ) p 


2 1 — 4/3 2 p 2 (m — 

( 1 -/ 3 ) P 


2 • sinc((l + /3)p(m - £)) 

—2 • sinc((l — P)p{m — £)). 


2 1 — 4/3 2 p 2 (to — c. 

We will use one of Parseval’s identities for periodic convolu¬ 
tion, namely 

OO 1 /* 7T 

/(z) = c k c l elkz = 7 ^ j gi{t)g 2 (z-t)dt. (20) 

Thus we identify the sought associate function / as the 
convolution of the two functions gi(z) and g 2 (z) with the 
corresponding Fourier series coefficients: 

(1 + P)p 


4 =- 


and 


(1 -P)P 


4 = 


sinc((l + £3)pk)+ 
sinc((l — /3)pk) 


-1 


( 21 ) 


( 22 ) 


4 /3 2 p 2 k 2 - 1' 

We can then based on ( |2Tj ) conclude that g\ is a sum of 
rectangular functions, periodic on the interval [— n, 7 r] and 
looks like: 

if (1 + /3)p < 1 
if (l+p)p>l, 


9i(z) = 


((24 ) 
d25 


(23) 


with 


1 


rect 


and 


We conclude that the spectrum of £j 2n+1 ^(£) as n goes to 
infinity is given by 

VTe u “ T ' |w| < 2itW' 

0 U > 27 tW', 


2(1 + /3)pn 


1 — - rect 
1 

+ - rect 


\ iect (w 4 ^ 


2(2 — (1 + /3)p)n 

z 


(24) 


(25) 


of which the envelope is the spectrum of a sine pulse with 
linear bandwidth 2 W'. Since the pulses are transmitted with 
a time spacing of T' = Tyfr seconds and since we know 
that a signal consisting of sine pulses of bandwidth 2 W' and 
time shift spacing T' = f v , are well localized in time, we 
conclude that the signal X(t) is well localized in time. □ 


,2(1 - /3)ptt, 

The function g\ is hence different depending on the value of 
the parameter combination (1 + /3)p. 

Evaluating g 2 (z) we try to find the Fourier series of the 
following function 

<72(z) = a ■ sin ) z e [-7T, 7r], (26) 




















and find that it has the following coefficients: 

~2 — 1 
Cfc = 


4 b 2 k 2 - 1 

2 ab f d\ 2ab(—l) n (n — d 

7r \ 2 b J 7 r \ 2 b 


92{z) = 


1 




cos 


m - n 

2 Pp 


we solve 

rb 


(27) 


2/3p 


((30) x > b 


cos | J '} —— ] df = ^ ( |3TT ) a < z < b . 


z < a 


With 


Comparing c 2 with c 2 , that is ( [27] ) with ( [22] >, we can identify 
that in our case we have b = (3p, d = (1 — /?p)7r, and a = • 

co S ((i-gp) 7 r/( 2 / 3 p)) ' Makin § some rearrangements we arrive at 


nn . . Z — 7T — b \ . / z — 7T — a 

2 pp — sin I ——- ) + sin 


2 (3p\ sin 


2 Pp 

— z —7 T + b 

2/3 p 


sin 


2/3 p 


z — 7T — a 

2/3p 


(28) 


+2 sin ^ 

V 2/3p y 


(29) 


(30) 


(31) 


Now we can use Parseval’s identity, l(20|), to retrieve /. 
Getting a closed form expression for the convolution is made 
easier since g\ is constant over intervals. However since <j\ 
changes depending on the value of (1 + /3)p and g 2 contains 
and absolute value \z — t\ where t is the integration variable 
the problem grows combinatorially. As an intermediate step 


2 f3p sin 


2/?p 


— sm 


2(3p 


(32) 


Combining the results ( |23j ), ( |28] l and ( |29[ i, and doing some 
simplifications to the expressions gives the sought associate 
function found in (| 1 9|. □ 
















